An Online Convex Optimization Approach to Blackwell's Approachability
نویسنده
چکیده
The notion of approachability in repeated games with vector payoffs was introduced by Blackwell in the 1950s, along with geometric conditions for approachability and corresponding strategies that rely on computing steering directions as projections from the current average payoff vector to the (convex) target set. Recently, Abernethy, Batlett and Hazan (2011) proposed a class of approachability algorithms that rely on the no-regret properties of Online Linear Programming for computing a suitable sequence of steering directions. This is first carried out for target sets that are convex cones, and then generalized to any convex set by embedding it in a higher-dimensional convex cone. In this paper we present a more direct formulation that relies on the support function of the set, along with suitable Online Convex Optimization algorithms, which leads to a general class of approachability algorithms. We further show that Blackwell’s original algorithm and its convergence follow as a special case.
منابع مشابه
Lecture : Blackwell's Approachability Theorem . Blackwell's Approachability Theorem
Counter-example to (.): If S = {(p,q) : p = q} and r(p,q) = (p,q), one can trivially, for all q, choose p(q) = q to guarantee that r(p(q),q) ∈ S. It is however not possible to find a p which works for all q, indeed the only p which works for a given q is p = q. However, the duality statement (.) holds when S is a half-space {x | v ·x ≥ c}. To see this, define a zero-sum game with scalar p...
متن کاملZero - Sum Games with Vector - Valued Payoffs
In this lecture we formulate and prove the celebrated approachability theorem of Blackwell, which extends von Neumann's minimax theorem to zero-sum games with vector-valued payoffs [1]. (The proof here is based on the presentation in [2]; a similar presentation was given by Foster and Vohra [3].) This theorem is powerful in its own right, but also has significant implications for regret minimiz...
متن کاملOnline Learning and Blackwell Approachability with Partial Monitoring: Optimal Convergence Rates
Blackwell approachability is an online learning setup generalizing the classical problem of regret minimization by allowing for instance multi-criteria optimization, global (online) optimization of a convex loss, or online linear optimization under some cumulative constraint. We consider partial monitoring where the decision maker does not necessarily observe the outcomes of his decision (unlik...
متن کاملA Learning Scheme for Blackwell’s Approachability in MDPs and Stackelberg Stochastic Games
The notion of approachability was introduced by Blackwell ([8]) in the context of vector-valued repeated games. The famous ‘Blackwell’s approachability theorem’ prescribes a strategy for approachability, i.e., for ‘steering’ the average vector-cost of a given player towards a given target set, irrespective of the strategies of the other players. In this paper, motivated from the multi-objective...
متن کاملA Learning Scheme for Approachability in MDPs and Stackelberg Stochastic Games
The notion of approachability was introduced by Blackwell [1] in the context of vector-valued repeated games. The famous ‘Blackwell’s approachability theorem’ prescribes a strategy for approachability, i.e., for ‘steering’ the average vector cost of a given agent towards a given target set, irrespective of the strategies of the other agents. In this paper, motivated by the multi-objective optim...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of Machine Learning Research
دوره 17 شماره
صفحات -
تاریخ انتشار 2016